Title: Speaker Adaptation of Hidden Markov Models Using Maximum Likelihood Linear Regression. Author: Supervisors
نویسنده
چکیده
Material and results from the current thesis may be used freely provided that the source is stated. Abstract The work presented in this report focuses on an essential problem when doing speaker adaptation; namely how eeectively the speaker speciic information in the adaptation data is used. In the project a system has been implemented for speaker adaptation of hidden Markov models (HMM's) using the Maximum Likelihood Linear Regression (MLLR) method. MLLR is a method that transforms mixture components of HMM's by multiplying the mean vectors with a transformation matrix. It introduces the concept of regression classes as a set of mixture components that are transformed similarly. The adaptation technique is implemented in C. The data used in the tests are taken from the Danish EU-ROM.1 database. All results are averaged over ten speakers. Three issues have been addressed: 1) the eeect of varying the amount of adaptation material, 2) the eeect of using diierent regression class divisions and 3) the importance of the phonetic content in the adaptation material. Tests show that the MLLR technique is very data eeective. Only 3s of speech is needed when a diagonal transformation matrix is used before a positive eeect of the adaptation is seen. When using a full matrix 5s are suucient. It was observed that there is a high dependence between the achieved performance and the regression class division. Transforming each mono-phone individually improves the phoneme cor-rectness from 50.9% for the initial speaker independent models to 58.8% for the adapted models. Based on several approaches it was concluded that there are diierences in the speaker speciic information available from diierent phonemes. Vowels were seen to vary a lot from speaker to speaker and to have relatively much innuence on the eeect of the adaptation. Fricatives on the other hand have very small inter speaker variances. The project addresses the problem of speaker adaptation of hidden Markov models using the Maximum Likelihood Linear Regression technique. Some prior knowledge of the theory of Markov models is assumed. The report is divided into four parts. The Preliminaries part contains the introductory sections including the deenition of the project. In the Theory part theory, techniques and methods are described. The implementation and tests are described in the Implementation, test and conclusion part. The last part of the report is the Appendix where additional information is given. Developed software is included on a oppy disk. References to …
منابع مشابه
The Speaker Adaptation of an Acoustic Model
This paper deals with several adaptation techniques, which are of the importance in cases when the identity of a speaker is known and we want to recognize his speech. We are using three different methods, namely Maximum Apriori Probability adaptation, Maximum Likelihood Linear Regression and Constrained Maximum Likelihood Linear Regression. Each of the methods yields various benefits, therefore...
متن کاملImprovement of MLLR Speaker Adaptation Using a Novel Method
This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum likelihood linear regression (MLLR). In MLLR, a linear regression-based transform which adapted the HMM mean vectors was calculated to maximize the likelihood of adaptation data. In this paper, the prior knowledge of the initial model is adequately incorporated into the adaptation. A series of spea...
متن کاملAdaptation of acoustic models for multilingual recognition
This paper evaluates the recognition performance of a system using acoustic models transformed across language boundaries. Parameters of hidden Markov models (HMMs) trained on speaker independent English data are adapted using Afrikaans adaptation data to realise speaker dependent, multispeaker and speaker independent Afrikaans models. Adaptation is performed using maximum a posteriori probabil...
متن کاملQuasi-Bayes linear regression for sequential learning of hidden Markov models
This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent speech recognition system to handle the nonstationary environments via the linear regression adaptation of HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute the sequential a...
متن کاملDiscriminative speaker adaptation with conditional maximum likelihood linear regression
We present a simplified derivation of the extended Baum-Welch procedure, which shows that it can be used for Maximum Mutual Information (MMI) of a large class of continuous emission density hidden Markov models (HMMs). We use the extended Baum-Welch procedure for discriminative estimation of MLLR-type speaker adaptation transformations. The resulting adaptation procedure, termed Conditional Max...
متن کامل